Q Learning based Reinforcement Learning Approach to Bipedal Walking Control

نویسندگان

  • Sudhir Raj
  • Cheruvu Siva Kumar
چکیده

Reinforcement learning has been active research area not only in machine learning but also in control engineering, operation research and robotics in recent years. It is a model free learning control method that can solve Markov decision problems. Q-learning is an incremental dynamic programming procedure that determines the optimal policy in a step-by-step manner. It is an online procedure for learning the optimal policy through experience gained solely on the basis of samples. A Q learning based reinforcement learning of a double inverted pendulum has been shown in this paper which reaches a limit cycle at the end of several learning cycles. The double inverted pendulum becomes stable, since the pole angle and pole angular velocity become zero. Stabilization of an equivalent double inverted pendulum representing a bipedal robot has been successfully implemented for balancing the pole angles in the required range using Q learning in Reinforcement Learning. Keywords—Q learning; Double inverted pendulum; Limit Cycle.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic bipedal walking assisted by learning

This thesis proposes a general control architecture for 3D dynamic walking. It is based on a divide-and-conquer approach that is assisted by learning. No dynamic models are required for the implementation. In the approach, the walking task is divided into three subtasks: 1) to maintain body height; 2) to maintain body posture; and 3) to maintain gait stability. The first two subtasks can be ach...

متن کامل

Feedback Control For Cassie With Deep Reinforcement Learning

Bipedal locomotion skills are challenging to develop. Control strategies often use local linearization of the dynamics in conjunction with reduced-order abstractions to yield tractable solutions. In these model-based control strategies, the controller is often not fully aware of many details, including torque limits, joint limits, and other non-linearities that are necessarily excluded from the...

متن کامل

Reinforcement learning based feedback control of tumor growth by limiting maximum chemo-drug dose using fuzzy logic

In this paper, a model-free reinforcement learning-based controller is designed to extract a treatment protocol because the design of a model-based controller is complex due to the highly nonlinear dynamics of cancer. The Q-learning algorithm is used to develop an optimal controller for cancer chemotherapy drug dosing. In the Q-learning algorithm, each entry of the Q-table is updated using data...

متن کامل

A dynamically-Balanced Walking Biped

Describes the mechanical, electronic and software design of a 10-DOF bipedal robot which has been constructed to study control, parameterisation and automatic expansion of the stability envelope of a complex real-time behaviour, namely, dynamically-balanced two-legged walking. The machine is physically complete and demonstrates reasonable reliability in movement control including dynamically-ba...

متن کامل

DOCTORAL THESIS PROPOSAL Biped Locomotion: Augmenting an Intuitive Control Algorithm with Learning

Foot placement is a key determinant for the stabilization of walking speed and lateral motion of a biped. However, there is no closed form expression for the foot placement parameters in term of the walking speed or other gait parameters. A simple and intuitive control algorithm (called “Turkey Walking”) based on Virtual Model Control (VMC) was successfully applied to planar bipedal walking. Ho...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013